RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/966,422 



DATE: 10/16/2001 
TIME: 15:52:20 
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A NOVEL HUMAN G- PROTEIN COUPLED RECEPTOR, HGPRBMY6, 



i 



Input Set : A:\30534111, app 

Output Set: N:\CRF3\10162001\I966422.raw 



<110> APPLICANT: Peder, J. N. 

Mintier, G. 

Ramanathan, C. S. 

Hawken , D . R . 

Cacace, A. 

Barber, L. 

Kornacker, M. G. 
<120> TITLE OF INVENTION 

EXPRESSED HIGHLY IN SMALL INTESTINE 
<130> PILE REFERENCE: D0040NP/3053-4119US3 
<140> CURRENT APPLICATION NUMBER: US/09/966,422 
<141> CURRENT FILING DATE: 2001-09-26 
<150> PRIOR APPLICATION NUMBER: 60/235,602 
<151> PRIOR PILING DATE: 2000-09-27 
<150> PRIOR APPLICATION NUMBER: 60/306,604 
<151> PRIOR FILING DATE: 2001-07-19 
<150> PRIOR APPLICATION NUMBER: 60/315,412 
<151> PRIOR FILING DATE: 2001-08-28 
<160> NUMBER OF SEQ ID NOS : 81 
<170> SOFTWARE: Patent In Ver, 2.1 
<210> SEQ ID NO: 1 
<211> LENGTH: 1683 
<212> TYPE: DNA 
<213> ORGANISM: Homo sapiens 
<4 00> SEQUENCE: 1 



Does Not Comply 
Corrected Diskette ^ :eoded 



atggagactt attccttgtc tttgggtaat caatcagtgg tggaacctaa catagcaata 60 
cagtcagcaa atttctcttc agaaaatgcg gtggggcctt caaatgttcg cttctctgtg 120 
cagaaaggag ctagcagttc tctagtttct agttcaacat ttatacatac aaatgtggat 180 
ggccttaacc cagatgcaca gactgagctt caggtcttgc ttaatatgac gaaaaattac 240 
accaagacat gcggctttgt agtttatcaa aatgacaagc ttttccaatc aaaaactttt 300 
acagctaaat cggattttag tcaaaaaatt atctcaagca aaactgatga aaatgagcaa 36 0 
gatcagagtg cttctgttga catggtcttt agtccaaagt acaaccaaaa agaatttcaa 420 
ctctattcct atgcctgtgt ctattggaat ttgtcagcga agga.ctggga cacatatggc 480 
tgtcaaaaag acaagggcac tgatggattc ctgcgctgcc gctgcaacca tactactaat 540 
tttgctgtat taatgacttt caaaaaggat tatcaatatc ccaaatcact tgacatatta 600 
tccaacgttg gatgtgcact gtctgttact ggtctggctc tcacagttat atttcagatt 660 
gtcaccagga aagtcagaaa aacctcagta acctgggttt tggtcaatct gtgcatatca 720 
atgttgattt tcaacctcct ctttgtgttt ggaattgaaa actccaataa gaacttgcag 780 
acaagtgatg gtgacatcaa taatattgac tttgacaata atgacatacc caggacagac 840 
accattaaca tcccgaatcc catgtgcact gcgattgccg ccttactgca ctattttctg 900 
ttagtgacat ttacctggaa cgcactcagc gctgcacagc tctattacct tctaataagg 960 
accatgaagc ctcttcctcg gcatttcatt cttttcatct cattaattgg atggggagtc 1020 
ccagctatag tagtggctat aacagtggga gttatttatt ctcagaatgg aaataatcca 1080 
cagtgggaat tagactaccg gcaagagaaa atctgctggc tggcaattcc agaacccaat 1140 
ggtgttataa aaagtccgct gttgtggtca ttcatcgtac ctgtaaccat tatcctcatc 1200 
agcaatgttg ttatgtttat tacaatctcg atcaaagtgc tgtggaagaa taaccagaac 1260 
ctgacaagca caaaaaaagt ttcatccatg aagaagattg ttagcacatt atctgttgca 1320 



Kmw Scf ttcncc Uscinc Err»rSttinmti7 

PRRORDnrrrrn fvcGtSTEoconnFmoN . serial wuMPERt ^ l A/p ' 

ATTN: KEW RU.,i:s CA5E5: fLEASE DISREGARD ENCUSI, -ALPIU- HEADERS. WUCr 4re .NSERTED DV 



^Wrapped Kuclci'ct 
Wrapped Aminos 



The luimbcrAcjct at the end of each line -vrapped" down to the i.«>f -n.. 

wa. retrieved in a wordproce«or .fUr creatS^^ic rtZlXu^^^ ma/ oca^r if your IS 

prevent ^Swrapptnc," , * 5^ mv^in to J; ihii will 



J^Inv.W u« li„^ ,v „.U. entire «u. . .xc..a 72 ch^c.« U^^i^ " 



.MUall^ed Amino 
Numbering 

.Noa-ASCII 



_Van«bIe Length. 



^fatcnCln 2.0 



__Sfcipped Sc<iucnccs 
(OLD RULES) 



^Skipped Sequences * 
(NEW RULES) 



_Use ofn'i Of Xta't 
(NEW RULES) 

__lnv«Iid <2I3> 
Response 




'*< or<220> 



_P«lcnCln 2.0 



^Misuse of n 



TTk mimbennt each 5* amino acid is misili«„ed. Do nol use Ub e*w4^. 
UK apace chanicfcn. instead ^ between numbers; 

•n»c aubmitted file was not aa ved in ASCII(DOS) tert, as re, uf«d by the Seau.^^ d 
ensure your auba^ueM .ubmlaafon Is a.vtd In A^I tTft. ««I««nce Rules. riea.e 

.csidue bavin, vaHaWcien^h^^ 

A-buj-inl>«(entlnvtr«io«JxT5i£;itfedfift<JM>i<a2J>„ctio„,-ii«-^^^^ . 
pc*viou.Iy«H..d«.dcie.dd.e<p,.„i.„e.u„»™.iy^^^ 

("a^ioNt^s^s^r^^^^^^^^ 

(i) SEQl^CE CHARACTERISTICS: (^np.jn^rtt^n.b^/ '*"^^ 

(«•) SEQUENCE DESCRimON:SEQ ID NO:xS^ SW^f::^-^^^ 

This sequence IS intcniionally tlcippcd i^ nw wnere A is shown) 

Plc«c.Uo.diu.,U,c-TiONUMDEROFSEQUENCES:"„.po„..o',„cI„d.«...kipp,, 

^ror^ul^nirjit^^^ «rin.cn,.o„^. p,„„ nr. foHawin, ,.„„ To, „c,. .kipped .c<,.en 

<400> aequcnce id number 

000 

rcrlll^^'r'f'^ Xaa-s have been detected in .he Sequence Listing, 

rcr 1.823 of Sequence Rules, use of <220>.<223> is MANDATORY .f ^ v . 

u Artifici.l sequence <2 1 3> ropon»e i, Unknowr 

(Sec Fcdcr.. R....„»0M„/,99«. Vo,. «. No. ,0.. pp. , .32^ (S« T.m crSc,«„« R„,„ 



AMC/MH . Biotechnology Syrt ems Dnnch - 08/2 1/200 1 



The Biotechnology Systems Branch of the Scientinc and Technfc^r^^^ 
Ccrttci^(STIC) detected errors when processing the folIowing^co^ffijgt^L^ 
form: ; " 

Application Serial Number: 0^ 6 ^^ '^ . 

Source: Q ^ ' ' ' 



Date Processed by STIC: IJy 



THE ATTACHED PRINTOUT EXPLAINS DETECTCD ERRORS^ < . 
PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: i'^.'^h ; 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMON THE 
APPLICANT, WITH A NOTICE TO COMPLYjor, . ' ../'^r^?^''^'^" 

2) TELEPHONING APPLICANT AND FAXING A COPY OF tHfS¥Rn*T<^^^ WITH A 
NOTICE TO COMPLY ^ . . . ^ - 



FDR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARkSPf]^^^^ 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2,1 e-mail help: Datin21heln@usDto.gOv or phone 703-^306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: Datin3helD(q:iusDto.gov or phone 703-306-41 19 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3,0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§I.82i - 1.825 effective October I. 1990 (old 
Riles) and the revised version (new rules) effective July I, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3 .0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 



Checker Version 3.0 can be down loaded from the USPTO website at the following address: 

h tt p ://w WW. US pto.gov/web/offices/pac/checker - 



Page 6 of 7^ 



VERIFICATION SUMMARY DATE: 10/16/2001 

PATENT APPLICATION: US/09/966,422 TIME: 15:52:21 

Input Set : A:\30534111.app 

Output Set: N:\CRF3\10162001\I966422 .raw 

L:16 M:270 C: Current Application Number differs. Replaced Current Application Number 
L:1336 M:258 W: Mandatory Feature missing^^,,.>e5d'5>^^^ATURE : "V 
L:1336 M:258 W: Mandatory Feature missin<^_<2.Z3^^^HER INFORMATION: CCrV^rT" --^-^-f 
L:2066 M:258 W: Mandatory Feature missing, <221> not found for SEQ IDft/sirV 
L:2066 M:258 W: Mandatory Feature missing, <222> not found for SEQ ID#(-67'0 
L:2066 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:67 

L:2067 M:258 W: Mandatory Feature missing, <221> not found for SEQ ID#:67 
L:2067 M:258 W: Mandatory Feature missing, <222> not found for SEQ ID#:67 
L:2067 M:341 W: (46) «n" or "Xaa" used, for SEQ ID#:67 

L:2080 M:258 W: Mandatory Feature missing, <221> not found for SEQ ID# 3^ 
L:2080 M:258 W: Mandatory Feature missing, <222> not found for SEQ ID# 
L:2080 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:68 

L:2081 M:258 W: Mandatory Feature missing, <221> not found for SEQ ID#:68 
L:2081 M:258 W: Mandatory Feature missing, <222> not found for SEQ ID#:68 
L:2081 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:68 



' <^ tt ^ ir\ 



<210> 16 
<211> 41 
<212> PRT 

<213> Artificial Sequence 

<400> 16 ^ D 



6 4 



r 



rir.*^^ ^ ^^^^ ^ , , J 



RAW SEQUENCE LISTING DATE- lO/lfi/onm 

PATENT APPI^ICATION: US/09/966,422 Till': ll'lKlo 

Input Set : A:\30534111.app 

Output Set: N:\CRF3\10162001\I966422, raw 

258 cggcaatcgc agtgcacatg 

261 <210> SEQ ID NO: 6 

262 <211> LENGTH: 23 

263 <212> TYPE: DNA 

264 <213> ORGANISM: Artificial Sequence 

266 <220> FEATURE; 

267 <223> OTHER^INPORKATXON: Description of Artificial Se<3uence : Synthetic ^ 

270 <400> SEQUENCE: 6 ^^"^^ 

271 cagacaccat taacatcccg aat 

274 <210> SEQ ID NO : 7 

275 <211> LENGTH: 22 

276 <212> TYPE: DNA 

277 <213> ORGANISM: Artificial Sequence 
2 79 <220> FEATURE: 

280 <223> OTHER^ INFORMATION: Description of Artificial Sequence : Synthetic X 

283 <400> SEQUENCE: 7 

284 agaatgaaat gccgaggaag ag - . 
287 <210> SEQ ID NO : 8 

'288 <211> LENGTH: 1230 

289 <212> TYPE: PRT 

290 <213> ORGANISM: RAT 

292 <400> SEQUENCE: 8 

293 Met Cys Pro Pro Gin Leu Phe He Leu Met Met Leu Leu Ala Pro Val 
296 val His Ala Phe Ser Arg Ala Pro lie P^o Met Ala Val Val Ar| Arg 

300 '35 I-eu Arg Cys Pro Gly Thr 

302 ASP val He Met He Glu Ser aH Asn Tyr Gly Arg Thr Asp Asp Lys 
305 lie cys Asp Ser Asp Pro Alf Gin Met Glu Asn He Arg Cys Tyr Leu 
308 Pro Asp Ala Tyr Lys He Met Ser Gin Arg Cyf Asn Asn Arg Thr G^^ 



311 cys Ala val Val Ala Gly Pro Asp Val Phe Pro Asp Pro Cys Pro Gly 



314 Thr Tyr Lys Tyr Leu Glu Val Gin Tyr Glu Cys Val Pro Tyr Lys Val 
317 Glu Gin Lys Val Phe Leu Cys Pro Gly Leu Leu Lys Gly Val Tyr Gin 
320 ser Glu His Leu Phe Glu Ser Asp His Gin Ser ojy Ala Trp Cys Lys 
323 ASP Pro Leu Gin Ala Ser Asp Lys He Tyr t" Met Pro Trp Thr Pro 
326 Tyr Arg Thr Asp Thr Leu Thr Glu Tyr sir Ser Lys Asp Asp Phe He 
329 Ala Gly Arg Pro Thr Thr Thr Tyr Leu Pro His Arg vTl Asp Gly 



nle://C:\Crf3\Outholc^Vs^I966422.htm 



10/16/01 



. ^ * Page 4 of 7 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/966,422 

Input Set : A:\30534111.app 

Output Set: N:\CRF3\10162001\I966422.raw 



DATE: 10/16/2001 
TIME: 15:52:20 



201 gccaaatatt aatgtattta ttagtttatc atatttaatt 

202 tattaatgta cacattttat agatgaggaa aatgtggttc 

203 ctagtgatca cagctaaaaa gtgatagagc tgttctttat 

204 accctggctc cctaatcaca gatgggcagg gtaggggttg 

205 agtggatgtg gctgccaacc acacaagttg tgccaaccca 

206 aatttggaat ctggcaaacc agtgtttggt tcttagctct 

207 aacttggttg aggtccctaa cttctcctga gggtgaacaa 

208 attaaatgtg ataacacctg taaacatcta acagagtgcc 

209 gcaattgaat tagggttatt tgtttctgtc tactgattgg 

210 ccaagtgtga atagcctata acactggtat aatttgtgaa 

211 aaaccaagac acacacacac acacacacac acacacacat 

212 atggacaccc agcttcacca atgacaatat ggattggcat 

213 agccctgggg ctaactggca cctagagagg tcatctcggc 

214 gtgctgaaaa gccagttcaa aaaattttga acccattgca 

215 accataaaaa taaattactg gaaaaatgaa ataaaaaata 

216 attttagaac tgttagattc aacagcaaaa aattgctgta 

217 ttcagtttct gtgcttatct ctctacgacc tttgtaacac 

218 catggataca ctctagtagc cccaatctag ctaaggcagc 

219 tgtcaaacag gaaag^ctgg caaaaccact ggtctgcatg 

220 gaaggatgca aacgtggaaa actgagtgga catggtgttc 

221 aattccagct tatttacctg cagttgctta caaagtgttt 

222 agggtttttt ttctggtttt taaaacaggt aaaggatgtc 

223 ttcttctgaa gtccagattt taacatctga tgccaataaa 

224 tagtgctacg cgagtggttg gacagatatt caacacttcc 

225 aaagaaagtt gccatagtaa cagtgagtca actcctagat 

226 aagagttgct gctactgcta atgatgatgc ccttacaacg 

229 <210> SEQ ID NO: 4 

230 <211> LENGTH: 449 

231 <212>. TYPE: DNA 

232 <213> ORGANISM: Homo sapiens 
2 34 <22 0> FEATURE: 

236 <400> SEQUENCE: 4 

237 acagtaaaac ttacctgttg tggtcttttt aatcacctcg 

238 ctcctttatt tcccagtcct ctcagaaagt cttcctcaat 

239 aattagataa aacctgttgt ttattattat tcggcataat 

240 ttttcaatag atttgtactt gaataaggtg aagaatttca 

241 ttgttcctta tatcgttaaa tctttgtgac acactttgac 

242 aaattctttt acaagttact ataaaggaca caaagagaaa 
24 3 atgactcctg atgaacagtg tgtggggatt tgcttgtatg 
244 aaaaaaaaaa aaaaaaaaaa aaaaaaaag 

247 <210> SEQ ID NO: 5 

248 <211> LENGTH: 80 

249 <212> TYPE: DNA 

250 <213> ORGANISM: Artificial Sequence 

252 <220> FEATURE: 

253 <223> OTHER INFORMATION: Description of Art 
2 54 oligos 

256 <400> SEQUENCE: 5 

2 57 gctgtgcagc gctgagtgcg ttccaggtaa atgtcactaa 



tttataacac 
tgagaggtga 
tttaaagttc 
ggtggggaca 
cagattgagg 
gccacttcta 
ctcacaaagt 
tagcacatag 
gtattgtttc 
atgatgctgc 
acacacacac 
gttttagcct 
cagtgccttc 
caccaatatt 
tgtataaaat 
tacatctctg 
acagtgaacc 
cccttatagt 
tactttgtcc 
aggagattga 
ggacataatt 
acagcaccac 
ttaactgctg 
agaaatgctt 
gccagtgaag 
cttattgagc 



cataaatagg 
agcattttgc 
acattgtact 
gaagttggag 
aaagatgcta 
agctgtgtga 
tgttttgctt 
cagggatcta 
tgacacttac 
catctagtga 
gtgcgcgcgc 
cacaacacag 
caaactacca 
tttgtgaaat 
acaaaccaaa 
accaattgct 
agcgctggcc 
taatcaatcc 
tttacacaag 
ggctcagcta 
gtgtaaagct 
ttaataacat 
agaacatcac 
cacctgaggc 
atgcttttca 
aa 



720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2212 



tttgagtttt 
gtattttgct 
ggacttggta 
cacaacatac 
aaaaatgtag 
actttacctt 
tattaaactt 



atctgtttct 60 
caggattaag 120 
gtttttctat 180 
aagagtacca 240 
aacctataac 300 
ccagaacaaa 36 0 
ttgacctctg 4 20 
449 



if icial Sequence : Syntheti 



cagaaaatag tgcagtaagg 60 



RAW SEQUENCE LISTING DATE: 10/16/2001 

PATENT APPLICATION: US/09/966,422 TIME: 15:52:20 



Input Set : A:\30534111.app 

Output Set: N:\CRF3\10162001\l966422.raw 

60 gttgtttttg gaattacctg gattctagca tacctgatgc tagttaatga tgatagcatc 1380 

61 aggatcgtct tcagctacat attctgcctt ttcaacacta cacagggatt gcaaattttt 1440 

62 atcctgtaca ctgttagaac aaaagtcttc cagagtgaag cttccaaagt gttgatgttg 1500 

63 ctatcgtcta ttgggagaag gaagtcattg ccttcagtga cgcggccgag gctgcgtgta 1560 

64 aagatgtata atttcctcag gtcattgcca accttacatg aacgctttag gctactggaa 1620 

65 acctctccga gtactgagga aatcacactc tctgaaagtg acaatgcaaa ggaaagcatc 1680 

66 tag ^^^^ 

69 <210> SEQ ID NO: 2 

70 <211> LENGTH: 560 

71 <212> TYPE: PRT 

72 <213> ORGANISM: Homo sapiens 
74 <400> SEQUENCE: 2 



7 5 Met Glu Thr Tyr Ser Leu Ser Leu 


Gly Asn Gin Ser Val Val Glu 


Pro 


76 1 








5 












10 










15 




78 Asn lie Ala lie Gin Ser Ala Asn 


Phe Ser Ser Glu Asn Ala Val 


Gly 


79 






20 












25 










30 






81 Pro Ser Asn Val Arg Phe Ser Val 


Gin Lys Gly Ala Ser Ser Ser 


Leu 


82 




35 










40 












45 








84 Val Ser Ser Ser Thr Phe lie His 


Thr Asn Val Asp Gly Leu Asn 


Pro 


85 


50 










55 












60 










87 Asp Ala Gin Thr Glu Leu Gin Val 


Leu Leu Asn Met Thr Lys Asn 


Tyr 


88 65 










70 












75 










80 


90 Thr Lys Thr Cys Gly Phe Val Val 


Tyr Gin Asn Asp Lys Leu Phe 


Gin 


91 








85 












90 










95 




93 Ser Lys Thr Phe Thr Ala Lys Ser 


Asp Phe Ser Gin Lys He He 


Ser 


94 




100 










105 








110 






96 Ser Lys Thr Asp Glu Asn Glu Gin 


Asp Gin Ser Ala Ser Val Asp 


Met 


97 


115 








120 










125 








99 Val Phe Ser Pro Lys Tyr Asn Gin 


Lys Glu Phe Gin Leu Tyr Ser 


Tyr 


100 


130 










135 












140 










102 Ala 


Cys 


Val 


Tyr 


Trp 


Asn 


Leu 


Ser 


Ala 


Lys 


Asp 


Trp 


Asp 


Thr 


Tyr Gly 


103 145 








150 












155 










160 


105 Cys 


Gin 


Lys 


Asp 


Lys 


Gly 


Thr 


Asp 


Gly 


Phe 


Leu 


Arg 


Cys 


Arg 


Cys Asn 


106 








165 












170 










175 


108 His 


Thr 


Thr 


Asn 


Phe 


Ala 


Val 


Leu 


Met 


Thr 


Phe 


Lys 


Lys 


Asp 


Tyr Gin 


109 






180 












185 










190 






111 Tyr 


Pro 


Lys 


Ser 


Leu 


Asp 


He 


Leu 


Ser 


Asn 


Val 


Gly 


Cys 


Ala 


Leu Ser 


112 




195 










200 










205 








114 Val 


Thr 


Gly 


Leu 


Ala 


Leu 


Thr 


Val 


He 


Phe 


Gin 


He 


val 


Thr 


Arg Lys 


115 


210 










215 












220 










117 Val 


Arg 


Lys 


Thr 


Ser 


Val 


Thr 


Trp 


Val 


Leu 


Val 


Asn 


Leu 


Cys 


He Ser 


118 225 








230 












235 










240 


120 Met 


Leu 


lie 


Phe 


Asn 


Leu 


Leu 


Phe 


Val 


Phe 


Gly 


He 


Glu 


Asn 


Ser Asn 


121 








245 












250 










255 


12 3 Lys 


Asn 


Leu 


Gin 


Thr 


Ser 


Asp 


Gly 


Asp 


He 


Asn 


Asn 


He 


Asp 


Phe Asp 


124 






260 












265 










270 






126 Asn 


Asn 


Asp 


He 


Pro 


Arg 


Thr 


Asp 


Thr 


He 


Asn 


He 


Pro 


Asn 


Pro Met 


127 




275 










280 










285 








12 9 Cys 


Thr 


Ala 


He 


Ala 


Ala 


Leu 


Leu 


His 


Tyr 


Phe 


Leu 


Leu 


Val 


Thr Phe 



